Correlation Clustering with Noisy Partial Information
نویسندگان
چکیده
In this paper, we propose and study a semi-random model for the Correlation Clustering problem on arbitrary graphs G. We give two approximation algorithms for Correlation Clustering instances from this model. The first algorithm finds a solution of value (1 + δ) opt-cost +Oδ(n log n) with high probability, where opt-cost is the value of the optimal solution (for every δ > 0). The second algorithm finds the ground truth clustering with an arbitrarily small classification error η (under some additional assumptions on the instance).
منابع مشابه
Clustering with Noisy Queries
In this paper, we initiate a rigorous theoretical study of clustering with noisy queries (or a faulty oracle). Given a set of $n$ elements, our goal is to recover the true clustering by asking minimum number of pairwise queries to an oracle. Oracle can answer queries of the form :"do elements $u$ and $v$ belong to the same cluster?"-- the queries can be asked interactively (adaptive queries), o...
متن کاملAn Efficient Framework for Accurate Arterial Input Selection in DSC-MRI of Glioma Brain Tumors
Introduction: Automatic arterial input function (AIF) selection has an essential role in quantification of cerebral perfusion parameters. The purpose of this study is to develop an optimal automatic method for AIF determination in dynamic susceptibility contrast magnetic resonance imaging (DSC-MRI) of glioma brain tumors by using a new preprocessing method.Material and Methods: For this study, ...
متن کاملEvaluating the Improvement of Partial Discharge Localization Accuracy Using Frequency Response Assurance Criterion
Partial Discharge (PD) is the most important source of insulation degradation in power transformers. In order to prevent catastrophic failures in transformers, PDs need to be located as soon as possible so that maintenance measures can be taken in time. Due to the structural complexity of windings, locating the PD source inside a transformer winding is not a simple task. In this paper, the effi...
متن کاملAdaptive Fuzzy C-means Algorithm with Spatial Information for Image Segmentation
This paper considers the problem of partitioning noisy images into different regions by fuzzy clustering approach. Based on two fuzzy c-means (FCM) algorithms (FCM S1 and FCM S2), we propose four adaptive algorithms (FCM S11, FCM S12, FCM S21 and FCM S22) which utilize the high correlation of image pixels to increase the algorithms’ robustness to noise. Unlike existing algorithms, our algorithm...
متن کاملSemi-supervised cross-entropy clustering with information bottleneck constraint
In this paper, we propose a semi-supervised clustering method, CECIB, that models data with a set of Gaussian distributions and that retrieves clusters based on a partial labeling provided by the user (partition-level side information). By combining the ideas from cross-entropy clustering (CEC) with those from the information bottleneck method (IB), our method trades between three conflicting g...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015